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ABSTRACT 

The report contains the asymptotic efficiencies of some candidate 
estimators which provide alternatives to maximum likelihood in some common 
probabilistic settings. The alternative estimators can be found with 
measurably less effort than solving the likelihood equations. They include 
the method of moments and similarly constructed estimators that involve 

the harmonic mean. The most successful example found deals with the 
negative binomial distribution. Here, the harmonic mean estimator has high 
efficiency in regions where the method of moments estimator has rather low 
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I. Introduction 



The need for readily computed parameter estimates is great. 

Maximum likelihood estimators are known to be asymptotically efficient, but 
often they are very hard to find. The most popular alternative is the method 
of moments which usually yields readily computed estimates, but sometimes 
these estimates are not very efficient. This report looks at the efficiency 
of the method of moments and of some similarly constructed *quasi-method 
of moment ’estimators . 

The basic idea is to select a system of estimating equations which 
equates various statistics to their expected values. The method of moments 
does this for sample moments of order one, two, etc. We propose to consider 
also moments of order zero, minus one, and perhaps other functions. The 
examination of the consequent efficiencies may aid in the building of 
intuition so that a wiser selection of estimating functions can be made in 
new situations when they appear. 

Moments of order zero and minus one require positive data. The former 

is the geometric mean and the latter is the harmonic mean. They form part of 

a general family, f(r) = [*— ^ of nondecreasing means (where x^,...,x^ 

forms the data). When dealing with the harmonic mean, we will be setting 
1 r 1 -1 

— ) — (= [f(-l)] ) equal to E(l/X) because, for the parent populations 

n ^ x^ 

treated here, the latter value is easy to obtain. Similarly when dealing 
with the geometric mean, we will be setting — In x^ (= In f(0)) equal to 
E(ln X) . In this form, the geometric mean appears in many maximum likeli- 
hood systems. This suggests that alternative quantities, that are close to 
means with r = 0, might be profitably exploited. The results give some 
support to this idea. 

The structure of the efficiency computations utilizes the theoretical 
work presented in a companion report [12]. The pertinent material is 
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summarized in Section II of the present report. Section III contains 
applications of the idea to two, one-parameter parent populations, Poisson 
and symmetric beta. Some speculations for interpreting these results are 
made. Section IV is devoted to two parameter settings; gamma, negative 
binomial, and beta. 

Much computational work is necessary to support this development and 
the details are relegated to the appendices. Appendix A contains general 
relationships among the required population moments. Specialization to 
Poisson, gamma, negative binomial, and beta is contained in Appendices 
B, C, D, E (resp.). Computations are performed by APL programs and these 
are included too, as Appendix F. 



II. Methodology 

It is assumed that the estimating equations have the form 



g(x,e) = 0 



( 2 . 1 ) 



where x = (x^,...,x^) is the data vector of a random sample from the specifie 



parent population, 0^ = (9 ,...,9 ) is the parameter point which belongs to 



' P' 

an open subset of p^imensional space, and g* = (g-,...,g ). Primes denote 



transpose. In order for the system (2.1) to have a unique solution 9, it 
is necessary (by the Implicit Function Theorem [16]) that the Jacobian 



J = 



* * * > gp 



89, , ... ,9 
1 P 



f 0 . 



The following structural assumptions are made: 



(i) Each gj(x,9) for j = l,...,p is a symmetric function of x, i.e. 



is invariant under permutations of x^^,...,x^. 
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(ii) 



(iii) 

(iv) 



(v) 



E{gj(X,e)} = 0 for j = 

Var{g^(X,e)} = ^ C^(e) + o(^) for j=l,...,p. 




respect to for j = 

0, which is the solution of (2.1) is consistent and asymptotically 
nonnal. 



Assumption (i) is modest and expected of any reasonable estimating 
scheme. The meeting of assumptions (ii) and (iii) is a matter of scaling 
and arrangement. Assumption (iv) is needed so that the asymptotic covariance 
matrix is well behaved. Assumption (v) is always desirable and convenient 
since it implies that the estimators are asymptotically unbiased and the 
ellipsoid of concentration [4] is characterized in terms of the inverse of 
the asymptotic covariance matrix. Efficiency com.putations are based on its 
determinant. 

Equipment for verifying (v) is contained in [9]. There, the functions 

1 

g^ are averages of the form — J “ l,...,p and this 

structure is consonant with the present set of assumptions. All of the 

cases treated in this report have this structure. 

Much license is taken in what follows. The purist is referred to [9]. 

Let A(x,0) be the p by p matrix of partial derivatives {3g./30, }. 

J ^ 

Assume that the elements behave as in (2.2) 

A^j^(X,e) -> EOg^/36j^} (2.2) 

as n The resulting limiting matrix is denoted by A or A(0) . 

The assumptions allow the first order expansion 
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g(x,9) = g(x,e) + A(x,0 + p(0-9))(e-0) 



(2.3) 



where p is a diagonal matrix of random numbers belonging to the interval 
[0,1]. Since the system is soluble, g(x,6) = 0 and we can write 

(0-0) = -a“^(x, 0 + p(0-0) g(x,0)) (2.4) 

since the continuity of A implies that of A ^ and of g. Letting the 
asymptotic covariance matrices, as n -> », be defined by 

M = limit nE(0-0) (0-0) ’ (2.5) 

C = limit nE{g(X,0) g’(X,0)} (2.6) 

it follows from (2.2) and (2.4) that 

M = a"^ C(A')"^ (2.7) 

The method of maximum likelihood fits into this setting. The parent 
population has density f(x,6), and (2.1) takes the form 
. . n 9^.n f (x . , 0) 

— S = ~ I — = 0 for r = l,...,p (2.8) 

n r n d0 

1=1 r 

Assumption (ii) requires the regularity conditions [10] as does the relation- 
ship 

■ "''rk «.9) 

' r k ' 

where A is the information matrix. Using (2.2) and (2.6) it is seen that 
both A and C are equal to A and hence 

M = (2.10) 

follows from (2.7). 
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Now suppose that only the first q likelihood equations ( 2 . 8 ) are 



used in the system g = 0 . Let us denote this subset by p = 0 , and the 
remaining p-q equations by h = 0 . Thus in partitioned form ( 2 . 1 ) becomes 

g = = 0 (2.11) 

h 

Proceeding formally, ( 2 . 6 ) becomes 



j E(py’) 


E(;ih') ) 


1 


1 ‘=11 


"l2 


C = limit n \ 


( 




( E(h'p) 


E(hh') ) 




1 — 1 
CM 
U 


*"22 



The information matrix can be partitioned likev/ise 





- 


^1 


CM 
t — f 
< 


( 2 . 13 ) 




I 


A21 


^'22 ) 






Further, define a p-q by 


q matrix 






= {EOh^/ 30 j,)}, 


j = 


l,...,p q, k l,...,q 


( 2 . 14 ) 


and a 


p-q order square matrix 










§22 = {E( 9 h 730 ^)}, 


j = 


1, . . . ,p-q; k = q+1, . . . ,p 


( 2 . 15 ) 


It is 


shown in [ 12 , Sec. IV], that 

1 


\i 


-®h ) 








-§21 


1 

C22 ( 


( 2 . 16 ) 


where 


C29 is as (2.12) , 

( 


-'-11 


-'12 ) 






'“I 


a 

®21 


1 

022 ' 


( 2 . 17 ) 
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and 



M 



-1 



All ^12 



Azi H 



(2.18) 



where 



^ ^ 21 *^ 11^12 ^ 21 ^ 12^22 ^^ 21 *^ 12 ^ 22 ^ ' '*’ ^ 22 *^ 22^22 



Si ^^11 ~ ^12^22^21^ 



S 2 ^ll®h^22 



(2.19) 



G 22 {^22 ~ ^21^11^h^ 



Because of (2.7), the determinant of (2.18) is 



M ^1 = |aS/|c| 



( 2 . 20 ) 



and it is useful to draw attention to its computation. Using partitioned 
form, see [6, p. 165], its ingredients have determinants 



C = A 



A = A 



111 


1^22 


^ 21 ^ 11^12 


111 


§22 


° 2 l ''^ 11^12 



( 2 . 21 ) 



and this is useful in computing the asymptotic efficiency, [12] 

Eff(0) = |m“^|/|a| 

In the special case of p = 2 and q = 1, (2.19) reduces to 



( 2 . 22 ) 
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^ |c| ^^12^22 ~ ^^12^21^22 ^11®22^ 



(2.23) 



and the determinant of M ^ reduces to the especially convenient form 



M 



^^11^22 ~ ^12^22^ 
^^ 1^22 ■ ®21 



(2.24) 



and the denomination of (2,24) is C . 



III. Single Parameter Settings 
Poisson . 

The Poisson random variable X has density 

f(x;A) = e ^A^/xI for x = 0,1,... (3.1) 

and the derivative of the log likelihood is 

S = f -1 (3.2) 

It is well-known that 

A = E(S^) = y (3.3) 

and the sample mean is the minimum variance unbiased estimator for all sample 
sizes. It is also the maximum likelihood estimate and the method of moments 
estimate. 

Let us look further solely for academic purposes. Since the Poisson 

has the property that its mean is equal to its variance it follows that the 

2 

sample variance s is also a "moment” estimate of A. Moreover, the idea 
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can be extended. There are many other statistics that can be equated to 

their expected values and the resulting equations solved uniquely for X. 

One other will be considered here, namely, the averages of reciprocals of 

the {1 + ^ convenient device for avoiding 

division by zero. The result will be called the harmonic mean estimator. 

2 

Since s is directly an estimate of X, its asymptotic efficiency 
is quickly and easily expressed. Using (2.22), (3.3), and (B.5) we have 

Eff(s^) y (3.4) 

X + 2X 

The harmonic mean estimator is characterized by the equation (see (B.7)) 



where 



y--^(l-e^)=0 



(3.5) 



1 ^ 1 
= - y — — 

n , 1 + X. 



(3.6) 



Equation (3.5) is in the form g = 0 (i.e. (2.1)) and satisfies the assump- 

* 

tions . The resulting estimate, X , can be computed using the iteration 
function that falls naturally out of (3.5), namely. 




(3.7) 



and ^ for all initializations X^ > 0, but convergence can be 

quite slow if a poor X^ is chosen. Then asymptotic variance C (see 
(2.6)) of (3.5) is the variance of (1 + X) ^ which can be computed from 
(B.7) and (B.8). The quantity A of (2.2) is obtained by differentiating 
(3.5) with respect to X, 
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(3.8) 



A = ^ (1 - e - Ae . 

X 

* 2 , 

Using (2.22), (2.7) we have Eff(A ) = A A/C, or more explicitly 

Eff(X*) = ^ (1 - e"^ - Xe”^)^/Var(j^) (3.9) 

A 

The efficiences of the two estimators are compared in Figure 3.1. 

Of course there is no point in using any estimator other than A = x, but 
the graph suggests that the harmonic mean may be profitably used in other 
problems in which a choice must be exercised. 



Figure 3.1 

Asymptotic Efficiency 
{ Poisson ) 




X 
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Just for fun, let us compare the values of the three estimators x, 



s , X when applied to some famous data. First, the number of deaths due 
to mule kick in 200 Prussian army corps years. The frequency counts are 
109, 65, 22, 3, 1 for variable values 0 thru 4 [7, p. 109]. The estimators 
are 

X = 0.61 , = .6109 , A* = .6093 

and agreement is rather good. Second (Rutherford Chadwick data), the 
number of radioactive disintegration in 2608 time intervals of 7.5 seconds 
each [4, p. 150]. This time we have 

X = 3.867 , = 3.633 , X* = 3.886 . 

The sample size is much larger but the value of X is in a range of lower 

2 

efficiency for s . Also, the radioactive decay is more properly modeled 

2 

with a pure death process and this may help explain the smaller value of s . 

One final comment. The convergence of the iteration function (3.7) 
has importance in finding the maximum likelihood estimate for X from a 
Poisson population in which the zero values have been truncated. That is, 
a trial that produces no counts does not come to the experimenter's 
attention [10, p. 3-13]. The density is 

-X X 

f (x,A) = — ^ — — — r , X = 1,2, . . . 

1 - e 

and the maximum likelihood equation is 




which, as a function of X , is the same as (3.5). 



10 



Symmetric Beta 



A Beta random variable with equal parameter values has density 



f(x,a) = ^ ^ ^ ^ x°^(l-x)“ for 0£x<_l, 0<a 



[r(a)]‘ 



(3.10) 



Using the psi function [1], 



♦(«) - 



one can write the derivative of the log density as 



(3.11) 



and 



= 2ip(2a) - 2\p(a) + ln(x(l-x)) 



A = 2<p’ (a) - A<p’(2a) 



by (2.9). Let In x = ^ ^i=l express the maximum likelihood 

equation as 



(3.12) 



(3.13) 



i|(a) - ^K2a) = Y {In x + In(l-x)} 



(3.14) 



so Cl, the maximum likelihood estimate, is a function of the geometric 
mean of x and 1-x. Also, it is difficult to find. 

Let us turn to the method of moments. Because of symmetry, see (E.ll), 
(E.12), 



M 2 ’ 



2 1 
o = 



4(2a + 1) 



Thus X cannot be used and we turn to the second moment. The form g = 0 
becomes 

1 



2 

s - 



4(2a + 1) 



= 0 



(3.15) 



and the estimator can be found explicitly 
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a 



(3.16) 




_1 

2 



and using (2.2), (2.7) produces 

n Var(a) = 4(2a + 1)^ Var(s^) 



The right side of (3.17) requires that (E.2) be used with (A. 
is not compact and is produced only by computer program. 

A harmonic mean option is available since 



a - 1 



(see (E.13)) for a > 1, and the variances are finite if a > 
calculations, exploiting (E.14) and (E.15) , show that both y 
where 



1 r 1 

y = n A — . 



n X, 
1 1 



n 



= - y 

n ( 1-x. 



should be used instead of either one alone in order to reduce 
We select the form g = 0 to be 

(a - l)(y + z) - (4a - 2) = 0 



and solve explicitly 



Application of (2.2), 



n Var(a ) 



(2.7), 



a 



* ^ y + z - 2 
y + z - 4 

(E.14), and (E.15) produces 



(2a - l)(g - 1)^ 
a - 2 



for a > 2 



(3.17) 

2) . The result 

(3.18) 

2 . Easy 
and z , 

(3.19) 

variability. 

(3.20) 

( 3 . 21 ) 

(3.22) 
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Figure 3. 2 

Asymptotic Efficiency 
( Symmetric Beta ) 




The efficiencies of these two estimates are compared in Figure 3.2. 

The second moment estimator, a, is uniformly better than the harmonic mean 

•k 

estimator, a , and this result needs interpretation in the light of the 
success in the Poisson case. This time the averages of X ^ and (1-X) ^ 
were used instead of (1+X) the latter being difficult to manage with 
the beta distribution, and the parameter must be at least two in order 
for the variance to exist. Also the positive sample space is 0 < x < 1, 
which entails large and variable values for the {x^^}, and this may explain 

k 

the lack of success. The estimator a may still have some uses because 
(unlike Oi) we have a simple explicit expression for its variance (3.22) 
and its efficiency is competitive for a more than (say) four or five. 
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IV . Two Parameter Settings 



Gamma 

The gamma random variable X has density 

f(x;a,3) = — (4.1) 

r (a)3“ 



and the partial derivatives of its logarithm are 



c X o_ 

■ ,2 - 8 

= In X - In S - ip (a) 



(4.2) 



where jp(a) is the psi function (derivative of log gamma). The maximum 

likelihood estimators (a, 3) is found from (4.2) by replacing x with x 
1 r n 

and In x with In x = — In x^. Thus it fits into our generalized 
moment scheme utilizing the arithmetic and geometric means. To solve the 
system, one sets 3 = x/a into the second member of (4.2) and search for 
the root of In a - i|;(a) = In x - In x, which requires a psi function 
capability [1,3]. This is not difficult with a large computer, but may 
be a challenge for a small one, e.g. a hand held calculator. Viable 
alternatives are available using the ordinary method of moments and a 
generalization that exploits the harmonic mean. 

The information matrix (2.9) is 

/ a/3^ 1/3 I 

A = (4.3) 

(1/3 if;' (a) ) 

whose inverse is 
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I 



A -1 = 1 ) 

at|) ’ (a )-l I _ ^ 

where primes denote derivative. 



(a) 



- 6 



(4.4) 



- 2 2 

The ordinary method of moments equates x and s to aS and a3 , 

respectively, and the resulting estimator is 



2 /- 

= s /x , 



~ _ - 2 / 2 
a = X /s 



(4.5) 



Using 9^=3 and 9 ^ = ot in order to conform with (A. 2), one can apply 
(2.2) to obtain 



A = 



( » 6 I 



2aB 3^ 



(4.6) 



Note: Although the method of moments shares an equation with maximum likeli- 

_ 2 

hood (x/6 “ ot/B = 0), there is no advantage to using this, through (2.18), 

in this case. 

The matrix C is obtained from (C.5), (C.6), and (C.7). Then (2.7) 
can be applied to get 

^ 2g + 3 
a 2(g+l) 

-6 

and 

|m 1 = 2(a+l)6^ (4.8) 




M = 2(a+l) 



Turning to another choice, let us recognize that the ordinary method 
of moments uses moments of order one and two, while maximum likelihood 
uses moments of order one and zero. Heuris tically one might find advantage 
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in moments of order one and minus one. Consider for the system (2.2) 
(see (C.2) and (C.8)), 



X - aB =0 
1 



(4.9) 



e (a -1) 



= 0 



where ^ 1 1 1/x^ and the consequent estimators 



* - 1 
B = X - - 
y 



* 

a = 






(4.10) 



xy - 1 



:k 

which satisfies B > 0 and a > 0 since the arithmetic mean is larger 
than the harmonic mean. Proceeding as before we obtain, for a > 2, 



-a 



-6 



and 



and 



* 

A = 



B^(a-l) 6 (a 



.2 2a - 3 



* ^ 2(g-l); 

a - 2 



|M*| = 2B^ 

’ ' a - 2 



2(a-l)‘ 



-B 



- 1)^1 

1 

■i 



for a > 2 



(4.11) 



(4.12) 



(4.13) 



The asymptotic efficiencies of the estimators (4.5) and (4.10) are 
computed from (2.22) using 



|a| = -^ (at|j' (a) - 1) 
B 



(4.14) 



These efficiencies do not depend on the scale parameter, B, and can be 
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plotted as functions of a, as is done in Figure 4.1. Also, the former 
appears in [15]. The alternative (a ,3 ) catches up to (a, 3) at a = 3 

and remains better thereafter. The relative efficiency of the latter with 
respect to the former is (see (4.13) and (4.8)) 



|M*| ^ (g-1)^ 

|m| (a-2)(a+l) 



(4.15) 



It is less than 100% for all a > 3 and falls below 90% for 4 < a < 7. 



Figure 4.1 

Asymptotic Efficiency 




FIGURE 4.1. Asymptotic Efficiency (Gamma) 
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As a numerical example, the three estimators were applied to 135 
observations on the service time of the check-in queue at a major hotel. 
The gamma distribution fits quite acceptably according to chi square 
criteria. The results are: 



6 = .303 6 = .305 

a = 6.33 a = 6.31 



= .298 

a* = 6.45 



Counterexample : This opportunity is taken to show that the representation 

(2.18), which is the main theorem of [12] , is false when one drops the 
assumption that the subsystems p = 0 (2.11) are likelihood equations. 

We use the system 

2 2 
s - aB = 0 



which has an equation in common with the method of moments (4.5), but this 
common equation is not part of the maximum likelihood system. This system 
has explicit solutions 



B = 



2 ( y 






1 ’ 



1 

a = 1 + -4; 

y6 



One readily develops, using (4.11) and (4.6) 



A = 



S (a-1) 
2a6 



3(a-l)' 

,2 



B 



and from (C.7), (C.9), (C.ll) 
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(a-2) 



c = 



2a6 (a+3) 



Using M ^ = A’C ^A, we calculate 



1 _ ^ 2a 



6^(a-l)^(a-2) 6^(a+3) 



M 



*-l 



a-2 1 



3(a-l) 6(a+3) 



a-2 + 1 



6(a-l) 6(a+3) 



a-2 1 



(a-1) 



2 2a(a+3) 



and this, clearly, has no submatrix in common with (upon inverting (4.7)) 




“ \ 

^ 2a + 3 ( 
a 2(a+l) ^ 



Negative Binomial 

The negative binomial density is parametrized 



f(x;r,p) = for x = 0,1,2,... (4.16) 

0<r,0<p<l, p+q = 1 

and the partial derivatives of its logarithm are 



Sp = r/p - x/q 

Sj. = iKr+x) - ii)(r) + In p 



(4.17) 



Using the basic recursive formula for the psi function [1] , 
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X 

,jj(r+x) - ,|,(r) = I — 

j=l 



1 

1 + j 



one may express the system of maximum likelihood equations as 



X - rq/p = 0 



ave 
i=l , • . , 



X. 

1 



(4.18) 



j = l 



r - 1 + j 



+ In p = 0 



Solving (4.18) is quite difficult to manage without a large computer. 
Appendix F contains an APL program (PSIB) which accomplishes it. 

The information matrix can be developed using (2.9). The result is, 
using 0^ = p and = r. 



j r/qp^ -1/p I 

I -1/P ^22 f 



(4.19) 



where 



= t|^'(r) - E(ij^’(r+X)) 



(4.20) 



The properties of are developed in Appendix D (see (D.22) and (D.26)), 

along with the series representation of the determinant (D.25), 



A| 



oo 




p n=l 



n 



n + 1 



r . n ^ 



(n+r) I 



(4.21) 



which converges rapidly for most values of p. For purposes of computation, 
one may as well use (4.21) in (4.19) and solve for ^22^ thus 



A 



22 




+ P 



A } 



(4.23) 
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To estimate by the ordinary method of moments, we use (D.5) and form 



the system (2.1) to be 



px ~ rq = 0 
2 2 

p s - rq = 0 



(4.24) 



The explicit solution is readily seen to be 



p = — , 



r = 



-2 

X 



2 

s - X 



(4.25) 



and it cannot be guaranteed that p < 1 and r > 0. Differentiating (4 
and taking expectations yields the matrix 



A = 



r/q 

r(l+q) 

P 






(4.26) 



Combining (2.6), (4.24) and (D.6) produces 



C = rq 



1 1 + q I 

1 + q 1 + 2(r+2)q + q^ 



(4.27) 



and finally the formula (2.7) 



M = 



2(r+l) 

2 

rq 






pq 



pq 






(4.28) 



and 



;M| = 2(r+l)p“/q 



(4.29) 



.24) 
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Let us turn to the question of using a different moment in the second 
equation. Since the negative binomial has positive probability mass at 
zero, we use averages of the (1+x^) ^ as was done when the Poisson case 
was considered. Perhaps that level of success can be matched. 

Letting ^ ^ ^1 system (2.1) 



px - rq = 0 
(r-l)qy - p + p^ = 0 



(4.30) 



because of (D.7). The system (4.30) cannot be solved explicitly, but it 
can be managed with a hand held calculator. Use the first member to obtain r 
as a function of p and its derivative 



xp 

r = — 

q 



dp 



(4.31) 



and substitute into the second member of (4.30) to obtain a function f 
of the form 

f(p) = + (r-l)qy - p 



= p - y + p (y - 1 + xy) 



(4.32) 



and having derivatives 

r— 

df , - N , P x(q + In p) 

— = (y - 1 + xy) + ^ 

q 

The solution of f(p) = 0 can be obtained by Newton ^s method, always 
remembering to update r as well as p. The initialization p = .5 and 
r = X appears to be satisfactory, but normally convergence would be faster 
if the moment estimators (4.25) are used. 
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The resulting estimator will be denoted by p , r . From (4.32) it 
is seen that 

f(0) = -y < 0 , f(l) = ^ > 0 

"k k 

SO that 0 < p <1, and r >0 follows from this by (4.31). 

Let us turn to the development of the asymptotic covariance structure 

"k 

of p , r . Taking partial derivatives and the expectation of (4.32) to 
produce the coefficient matrix A of (2.2) yields 



k 

A 




r/p 




-q 




In p 



] 

( 



(4.33) 



with the help of (D.7). Attention is drawn to the fact that 



r 

limit ^ ^ ^ = p - p In p 

r _ 1 

r ^ 1 



(4.34) 



The covariance matrix C of (2.6) is derived from (4.30) with the help of 
(D.5), (D.7), and (A. 5). The result is 



) " 

irqp^ - pd-p’^) 



rqp’^ - pCl-p’^) 
(r-l)^q^ Var(Tfl^) 



(4.34) 



Let us draw attention to the fact the first equation of (4.17) is a multiple 
of the first equation of (4.32) rather than being identical. This is the 
reason that (4.33) and (4.35) are modifications of (2.17) and (2.16) rather 
than exact analogies. Thus the bookkeeping that follows must be done 
carefully. 
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The direct development of the matrix M from (2.7) with (4.34) and 
(4.35) as input is messy. Instead let us recognize that (4.30) shares an 



equation 


with the likelihood 


system (4.18) 


and 


given by 


(4 . 19) , we have 


( r/qp^ 


-1/p 




*-l 


1 






M 


= / 








1 -1/p 


m2 2 

M 
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where M is obtained from (2.23) and 



(4.36) 



from (2.16). Thus 



M 



22 



k- 2 2 ^ ^22 ) 

Cl ) 2 ®22 p °21®22 2 

' (qp P ) 



r p _ 2 

2 22 ®21 

qp 



(4.37) 



and (§21’ ^22^ second row of A in (4.33) and is taken from 

(4.35). Using (2.24) we obtain 



and (4.36) 

|c| . p (r-l)\ Var(^) - 

The asymptotic efficiencies of (p,r) and (p ,r ) appear in Tables 

4.1 and 4.2, resp. , for p = .1(.1).9 and r = .5(.5)5, 6(1)19 where the 

parentheses indicate the indices of advancement. The efficiency of (p,r) 

is monotone increasing in r for each p. It is lower for the smaller 

^ * 

values of p. The efficiency of (p ,r ) is high for the smaller values 
of r and decreases (generally). It is not monotone for p = .1, .2. 
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The relative efficiency of p , r with respect to p, r, i.e., 

Rel eff = 1^1 

* * 

appears in Table 4.3. Generally p , r is preferable for r less than 
or equal to (say) 2.5. In general p, r is preferable elsewhere, but it 
does not matter much for small values of p. 

The three estimation schemes were applied to the Cricket score data 
of Keep, Pollard and Benjamin [13], which provided the following comparisons. 





Cowdry 


Barrington 


Graveney 


X 


1.692 


2.095 


1.570 


2 

s 


4.343 


4.939 


4.474 


(l+x)'^ 


.603 


.538 


.626 


n 


156. 


116. 


107. 


P 


.329 


.345 


.317 


r 


.831 


1.014 


.729 


P 


.390 


.424 


.351 


r 


1.080 


1.543 


.849 


•k 

P 


.371 


.326 


.389 


k 

r 


1.000 


1.012 


1.000 
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TABLE 4.2. ASYMPTOTIC EFFICIENCY OF 
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TABLE 4.3. RELATIVE EFFICIENCY OF p , r WITH RESPECT TO 
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I 



Beta 



The beta density has the form 



_ r (a+6) .6-1 

£(k«, 6) (1-x) 



(4.37) 



for 0 < X < 1, 0< a, 0 < 6 



and the partial derivatives of its logarithm are 



= i|;(a+6) 4f(a) + In x 

S = i(j(a+6) ~ ip(6) + In(l-x) 

P 

The information matrix is, for ~ ^2 " ^ 

j ip’(a) - ip*(a+6) -il)*(Orf6) | 

( - iKoH-B) - if-'(cM-e) ) 



(4.38) 



(4.39) 



The system of maximum likelihood equations 

In X = ( a) - ( of 6) 

In (1-x) = i|)(6) - i(i(a+6) 



(4.40) 



uses the geometric means of x and 1-x, and is difficult to solve. \{hat 
other pairs of statistics might be substituted? 

Clearly x and (1-x) cannot be paired since they are functionally 
related. The latter is merely 1-x. Let us first develop the ordinary 
method of moments. Using (E.4) and (E.5), choose the systems 

(a+6)x - a = 0 



(4.41) 



(a+6)^ (a+6+l)s^ - ag = 0 



and solve explicitly for 
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s - x(l-x) 
a = 

1-x 



s - x(l-x) 

X 



(4.42) 



It may occur that a < 0, 3 < 0. 

The coefficient matrix A of (2.2) is 



-6 



A = 



a+3 



e 



(4.43) 






a+3+1 



A = 



-aB 

a + 3 + 1 



(4.44) 



and C of (2.6) takes the form 



a3 



a+3+1 



C = 



(a+8) ^ (a+3+1) Cov(x,s^) 



(a+3) ^(a+B+1) Cov(x,s^) 
(a+3) ^ (a+3+1) ^ var(s^) 



1 

) 



(4.45) 



and the use of (E.2) thru (A.l) and (A. 2) does not appear to simplify in 

2 

any useful way. Appendix F contains programs to compute |m| = |c|/|a| . 
Let us turn to the pair of statistics 



y 



1 “ 1 
-I - 

n^Xi 



z 




1 



1-x. 

1 



(4.46) 



which are not functionally related. Using (E.8) we may choose the system 



(a-l)y - (a+B-1) = 0 
(8-1) z - (a+B-1) = 0 



(4.47) 



for a > 1, 3 > 1. The solution is 
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y( 2 -i) 
yz - y - z 



z(y-l) 
yz - y - z 



(A. 48) 



* 

a 



* 

6 = 



Clearly y > 1 and z > 1 but the denominators are not necessarily positive 
since zy-y-z+l= (y-1) (z-1) can be less than one. 

For the system (4,47) the coefficient matrix becomes 



A 




-1 

g 

B-1 



(4.49) 



and with the help of (E-9) and (E.IO) one can calculate 




(4.50) 



for a > 2, 3 > 2. Use of (4.49) and (4.50) in (2.7) does not produce a con- 
venient expression for M . However its determinant is easily managed. 

For fun, let us also try a system based on moments of order one and 
minus one. (This would seem to straddle the two geometric means appearing 
in maximum likelihood.) We are lead to 



(g+6)x - g = 0 
(g-l)y - (g+6-1) = 0 



(4.51) 



and the resulting estimate 



(y-l)x 

yx-1 



g* ^ (y-1) (1-x) 
yx-1 



(4.52) 
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and this satisfies 



a > 0, 3 >0. (But do not forget that use of (E.6) 



in (4.51) requires a > 1.) Proceeding in the usual way, we calculate 



A = 



a + 6 
B 

a - 1 



g 

g + B 
-1 



(4.53) 



i gB 
g + B + 1 

-B 



! 

B (g-PB+1) \ 
g - 2 / 



4.54) 



Again M does not have a convenient form. The determinates of (4.52) and 
(4.53) are expressed 



A| 



-S 

(g+B) (a-1) 



|c 




(4.55) 



for g > 2. 

The efficiencies of (4.42), (4.48), and (4.52) are compared in the 
tables that follow. Table 4.4 contains the efficiency of the ordinary moment 
estimator. Efficiency is high if a is not too far from B and both are 
at least two. Elsewhere they are low, but still the choice because all the 
numbers in Table 4.4 are better than their competitors in Tables 4.5 and 4.6. 
The pair (g ,B ) is generally better than (g ,B ) but not uniformly so. 

It matters little since (g,B) is the ”hands down" winner. This result 
parallels what was learned in the symmetric beta case. The variable X ^ 
is unstable in this population. 
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TABLE 4.4. Eff(a,6) 



BETA POPULATION 



.5 

.5 0-. 4 93 

1.0 0.554 

1.5 0. 53 7 

2.0 0.507 

2.5 0.477 

3.0 0.451 

3.5 0.429 

4.0 0.411 

4.5 0. 395 

5.0 0. 382 

5.5 0.370 

6.0 0.360 

6.5 0.351 

7.0 0.344 



1.0 1.5 

0.554 0.537 

0.713 0.747 

0.747 0.820 

0.739 0.839 

0.719 0.835 

0.696 0.822 

0.674 0.806 

0.653 0.790 

0. 63 5 0.774 

0. 61 8 0.758 

0.604 0.744 

0.591 0.731 

0.579 0.719 

0.568 0.709 



2.0 2.5 

0.507 0.477 

0.739 0.719 

0.839 0.835 

0.878 0.889 

0.889 0.912 

0.887 0 .919 

0.878 0.91 8 

0.867 0.912 

0.855 0.904 

0.843 0.895 

0.831 0.886 

0.81 9 0. 8 76 

0.809 0.867 

0.799 0.858 



3.0 3.5 

0.451 0.ii29 
0 . 696 0.674 
0.822 0.806 
0.887 0.878 
0.919 0.918 
0.934 0.938 
0.938 0.948 
0.938 0.952 
0.933 0.951 
0.928 0.948 
0.921 0.944 
0.914 0.938 
0.906 0.933 
0.899 0.927 



4.0 

.5 O.'+ll 
l!o 0.653 

1.5 0.790 

2.0 0.867 

2.5 0.912 

3.0 0.93 8 

3.5 0.952 

4.0 0. 959 

4.5 0.961 

5.0 0.961 

5.5 0 . 958 

6.0 0.955 

6.5 0.951 

7.0 0.946 



4.5 5.0 

0.395 0.3 82 

0. 63 5 0.61 8 

0.774 0.758 

0.855 0.843 

0.904 0. 895 

0.933 0.928 

0. 951 0.948 

0.961 0.961 

C.966 0.968 

0.966 0.972 

0.96S 0.973 

0.966 0,973 

0. 963 0,972 

0. 960 0.969 



5.5 


6.0 


0*370 


0. 360 


0 .604 


0.591 


0 .744 


0.731 


0.831 


0.819 


0.886 


0.876 


0. 921 


0.914 


0 . 944 


0. 93 8 


0.95 8 


0. 955 


0. 968 


0.966 


0. 973 


0. 973 


0. 976 


0. 977 


0.977 


0.980 


0. 977 


0. 980 


0 . 976 


0.980 



6.5 7.0 

0.351 0.344 

0. 579 0.568 

0. 719 0.709 

0.809 0.7 99 

0.867 0.858 

0. 906 0.899 

0. 933 0.927 

0. 951 0. 946 

0.963 0.960 

0.972 0.969 

0. 977 0.976 

0. 980 0. 980 

0.982 0.983 

0. 983 0. 984 
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TABLE 4.6. Eff(a*,s''') 



BETA POPULATION 





.5 


1.0 


1.5 


o 

CM 


2.5 


3.0 


3.5 


2.5 


0.119 


0. 202 


0.258 


0 .297 


0.326 


0.347 


0.364 


3.0 


0 .1 63 


0 . 276 


0. 353 


0.407 


0.447 


0.477 


0.500 


3.5 


0.184 


0.313 


0 . 400 


0 .461 


0.506 


0.540 


0.566 


4.0 


0.196 


0. 33 3 


0.426 


0 .492 


0.5 40 


0.576 


0.605 


4.5 


0.204 


0.346 


0. U43 


0.511 


0.561 


0.599 


0.629 


5.0 


0 . 209 


0. 355 


0.454 


0.524 


0.576 


0 . 61 5 


0.645 


5.5 


0. 212 


0. 361 


0 . 462 


0.534 


0.566 


0 . 626 


0.657 


6.0 


0 . 21 5 


0. 366 


0. 468 


0.5 41 


0.5 94 


0.634 


0. 666 


6.5 


0. 217 


0. 369 


0. 473 


0.546 


0. 600 


0.641 


0 .673 


7.0 


0. 21 8 


0.372 


0 . 476 


0.5 50 


0.604 


0.645 


0.678 





4.0 


4.5 


5.0 


5.5 


6.0 


6.5 


7.C 


2.5 


0.378 


0. 389 


0.398 


0.406 


0.412 


0. 41 8 


0. 423 


3.0 


0.519 


0.534 


0.546 


0.557 


0. 566 


0.5 74 


0.581 


3.5 


0.588 


0. 605 


0.619 


0 . 632 


0 . 642 


0 . 651 


0.659 


4.0 


0.627 


0 . 646 


0. 662 


0.675 


0.686 


0 . 695 


0.704 


4.5 


0. 653 


0 .672 


0.688 


0.702 


0.714 


0. 724 


0.733 


5.0 


0. 670 


0. 690 


0.707 


0. 721 


0. 733 


0.743 


0.75 2 


5.5 


0.682 


0.7 03 


0. 720 


0.734 


0. 746 


0.7 57 


0.766 


6.0 


0.691 


0.712 


0.730 


0.744 


0.757 


0.768 


0.777 


6.5 


0 .698 


0. 719 


0. 737 


0.752 


0.7 65 


0.776 


0.785 


7.0 


0.704 


0.725 


0.743 


0.758 


0.771 


0.782 


0.792 
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APPENDIX A 



General Variance and Covariance Formulae 
Connecting the Mean, Variance, Harmonic Mean 



Let X^,.. 



mean with 



,X^ be a random sample of size n and denote the sample 



n 



X = M X, , 






the sample variance with 



2 1 



n 



■ • 



the inverse of the harmonic mean with 



Y . i y i 

"1 ’ 



and the inverse of the shifted harmonic mean with 



1 ^ 1 
Y * = — y — ^ 
n ^ 1+X. • 

1 1 



- 2 2 

It is well known that E(X) = p , E(s ) = a when the population mean, p, 

2 r r 

and variance, a , exist. Let m^ = E(X ) and “ E(X-p) . The follow- 
ing relationships are needed. 



Cov(X,s^) 



— {m» 
n 3 

1 

n ^3 



3m2m^ + 2m^} 



(A.l) 
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(A. 2) 



Var(s^) - ^ {m. - 4m m + 3m^ - H ^ o^} 

n 4 3 1 2 n-1 

1 / ^ a. 2 4, 

n 4 n-1 



n 



Var(^ I (X -X)h 
1 



^ - ^2 



^^4 ^^ 2 ^ ^4 ^^2 

- 2 ■ • + — T — - 



Cov(X,Y) =- {1 - E(X) E(l/X)} < 0 
n 



Cov(X,Y') = ^ U - [1 + E(X)] E(^)} < 0 



Cov(s^,Y) = - - {m„E(l/X) + m - 2m^E(l/X)} 
n / i 1 



1 / 2 , , 
= - — lM„m - IJ m + y} 
n z ~1 -1 



Cov(s^,Y') = - { [^2 - (1+u)^] + 1 + y} 



(A. 3) 

(A. 4) 
(A. 5) 
(A. 6 ) 

(A. 7) 



The proofs of (A. 1) to (A. 7) follow. It is convenient to record 
the relationships 

^ 2 
ni^ = ^2 ^ 

1113 = ^3 + 3^2^ + (A. 8 ) 

2 4 

+ 4P3P + 6^2^ + P 

To enhance the readability, the symbols E, V, C will be used to denote 
expectation, variance, and covariance (resp.). Parentheses and subscripts 
will be used sparingly. 



37 



Proof of (A. 4). Consider 



EXY = ^ EEX.Z(1/X.) 

^ ^ 2 
n 



= ^ {n + n(n-l) EXE(1/X)} 
n 

from which we subtract EXE(1/X) to produce (A. 4). The fact that (A. 4) is 

negative follows from the presumption that X > 0 and the consequence that 

the harmonic mean is less than the arithmetic mean. Thus (ave X.) x (ave 

1 X. 

1 

unless (all = constant) and apply the law of large numbers. 

Proof of (A. 5) . Follows from the fact that C(1 + X)Y* = CXY* and the 
application of (A. 4) . 

Proof of (A. 6). Consider 



n(n-l) Es Y = 



E Z ^ Z (X -X)^ 
i ^ 



1 2-2 
= E E ^ (Lxf-nX^) 



XT X.X. 

= ESZ — 3- - - ZEE -i— 1- 
X . n X, 

1 K 



= nEX + n(n-l) EX^E ^ - EX - 2(n-l) EX 

A. 

- (n-l) EX^E ^ - (n-1) (n-2) E^XE 



and divide through by (n-l) to get 
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nEs^Y = -EX + (n-1) EX^E 



= -m^ + (n-l)m2E - 



1 2 1 

- - (n-2) E XE j 

- (n-2)mjE ^ 



2 1 

Then subtract n(m 2 -m^)E~ to obtain the first version of (A,6). The 
second version follows from (A. 8). 



2 

Proof of (A.7) > Follows from (A. 6) because s and invariant under 

trans lations. 



Proof of (A> 1) . Let us work with 



EXs^ = E{EX^EX. - - (EX.)^} 

n(n-l) 1 j n 1 



n(n-l) 



^ {nEX^ + n(n-l)EX^EX - EX^ “ 3(n-l)EX^EX- (n-1) (n-2 ) e\} 



- {EX^ + (n-3)EX^EX - (n-2)E^X) 
n 



= — {m^ + (n-3)m^m. - (n-2)m^} 
n j z 1 1 



and subtract ^2^1 ” ®1* second version follows from (A. 8) 



Proof of (A. 2) , Let us begin with 

(n-l)^Es^ = E(EX^)^ - - EExLeX.)^ + E(EX )"^ . 

1 nil z 1 

n 



and treat the three main ingredients separately. 
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(A. 9) 



E(EX^)^ = nEX^ + n(n-l)E^X^ 

EX^(EXj)^ = nEX^ + n(n-l)E^X^ + n(n-l) (n-2) EX^E^X + 2n(n-l)EX^EX (A. 10) 

E(EX^)^ = nEX^ + 4n(n-l)Ex\x + 3n(n-l)E^X^ + 6n(n-l) (n-2) EX^E^X 

+ n(n-l)(n-2)(n-3)E^X (A. 11) 

The proper combination of (A. 9), (A. 10), and (A. 11) produces 

Es^ = - {EX^ - 4EX^EX 
n 

+ ^ [(n^-2+3)E^X^-2(n-2) (n-3)EX^E^X+ (n-2) (n-3)E'^x] } 
n—1 

4 

The subtraction of a in the form 

4 2 2 2 2 4 

a = EX - 2EX E X + EX 

yields 

2 4 22 424 

nVs = EX - 4 EX EX + 3E X - 4a + a 

n-1 

and this is (A. 2). The second version follows from using (A. 8) and modifying. 

Proof of (A. 3) . The variance of this form of the sample variance can be 
developed from (A. 2) using (A. 8). It also appears in [7, p. 183]. 
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APPENDIX B 



Moments of the Poisson 
The Poisson random variable X had density 

f(x;A) = e ^ A^/xI , x = 0,1,2,... (B.l) 

2 

and it is well knol^m that y = A, a = A. The probability generating function 
is 

G(u) = E(u^) = (B.2) 



and the moment generating function can be obtained from (B.2) by the replace- 
ment e^ for u. Moments can be obtained by repeated differentiation. 

We record 



E(X) = A 
E(X^) = + A 

E(X^) = X + 3A^ + X^ 

4 2 3 3 

E(X ) = X + 7X + 6X + X 



(B.3) 



Use of (B.3) into (A. 2) produces 

= X + 3X^ (B.4) 

Var(s^) = - {2X^ + X} + o(% (B.5) 

n n 

Cov(X,s^) = ^ A (B.6) 
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The moments of (1+X) ^ can be obtained by integrating the 
generating function (B.2) 



E(y^) = E / u^du = / G(u) du = e e^^ du = “ (1-e (B.7) 



0 



0 






1 1 „ „ 

E / / u V du dv = // G(uv) du dv 

0 0 



-A . - Auv , , 

e fje du dv 



-A 1 Au - -A CO 1 j 

du . V n •!“ 

* 0 " * 1 0 



= e 



-X V 

i j'ji 



(B.8) 



This opportunity is taken to record an alternative way of obtaining 

moments, (B.2), which in this case is somewhat easier than the differentiation 

of the moment generating function. One begins with the generating function 
X 

G(u) = E(u ) and replaces the argument u by a product of dummy variable 
uv • • • z containing as many factors as the order of the moment to be calcu- 
lated. Then one takes a partial derivative with respect to each variable 

u, V, etc. and the desired moment is obtained when each variable is set to 

2 

unity. For example, E(X ) can be obtained from 



G 

uv 



3u 9v 



G(uv) 



3 xX X-1 X-1, 

3^ E(uv) = E(X u V ) 



(B.9) 



The four moments (B.3) can be obtained in this way replacing u 
with uvwz in (B.2). The resulting derivatives are 
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G 

u 



G 

uv 



G 

uvw 



G 

uvwz 



XG 

XG[1+ Xuvwz] 

2 2 

XG{z[l + Xuvwz] + Xuvwz [1 + Xuvwz] + Auvwz } 

Xuvw G + XG([1 + Xuvwz] + zXu^A^7 + 2zXuvw( 1+Xuvwz) 
uvw L L j 

+ + 2zAuvw} 
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APPENDIX C 



Moments of the Gamma 



The gamma random variable 
f(x;a,B) = 



X has density 

_1 ^a-l^-x/6 

r(a)e“ 



and it is well-known that 

2 2 

y = aS a = aS 

Direct integration produces the formula, for r > 0 



and for r < a 



E(X^) = 6 



r r (g+r) 
r(a) 



E(X = 



r(g-r) 

r(a) 



Use of (C.3) in (A.l) and (A. 2) produces 



Var(X) = - 
n 

- 2 2 3 

Cov(X,s ) = - aB 

Var(s^) = 2aB^ + ~) 

n-1 n 



(c.l) 



(C.2) 



(C.3) 



(C.4) 



(C.5) 



(C.6) 



(C.7) 



Using (C.4) one readily calculates 
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' 



6(a-l) 



1 < a 



■ . 2 / ,: 2 , 



2 < a 



6 (a-1) (a-2) 



*1 ^ 

and letting ^ (1/X^) , we find that, with the aid of (A. 4 



Cov(X,Y) = 



-1 



n(a-l) 



1 < a 



and with the aid of (A. 6) 



Cov(s ,Y) = 0 



Because of the fomula 



r'(y) = / ln(y) y“ ^ e ^ dy 
0 



one can develop, for r > - a, 



E(X^ In X) = (In 6 + ^'(a+r)) 



since f'(ci) = f (a) ip(a) , 



(C.8) 

(C.9) 

(C.IO) 

(C.ll) 

(C.12) 

(C.13) 
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APPENDIX D 



Moments of the Negative Binomial 
The density has the form 

f (x; ,r,p)_ = for x=0,l,-.. (D.l) 

0<r,0<p<l, p+q = 1 

Its probability generating function 

Y 

G(u) = E(u ) = — 2 (D.2) 

(l-qu)*^ 

will be exploited broadly. Successive derivatives of (D.2) evaluated at 
u = 1 produce the factorial moments 

= EX(X-l) ••• (X-s+1) = (q/p)® r(r+l) ••• (r+s-1) (D.3) 

to which one may apply some orderly substitution and obtain the first four 
moments, using A = q/p 

EX = Ar 

EX^ = A^r(r+1) + Ar 

(D.4) 

EX^ = A^r(r+1) (r+2) + 3A^r(r+l) + Ar 

EX^ = A^r (r+1) (r+2) (r+3) + 5A^r (r+1) (r+2) + 4A^r(r+l) + Ar 
The mean and variance of the population are 



46 



li = rq/p 



(D.5) 



2 , 2 
o = rq/p 



Use of (D.4) in (A.l) and (A. 2) provides the covariance matrix for 
the ordinary method of moments 



n Var(X) = A^r + Ar = rq/p^ = 



n Cov(X,s^) 



2A^r + 3A^r + Ar = 



n Var(s^) = 2A^r(r+3) + 4A^r(r+3) + A^r(2r+7) + Ar 
= o^[2(r+3)q+p^]/p^ 



(D.6) 



The harmonic mean alternative requires 






E(— i— ) ^ 



= P-P 



(r-l)q 



= — i r 1 i] 

V-D'l ‘o u J 



1 du 



(D.7) 



(D.8) 



for r ^ 1. The case r = 1 is treated in [12, Appendix A]. Expressions 
(D.7) and (D.8) are justified next along with computational formulae for 
(D.8) when r is either a whole number or a whole number plus 0.5. 



Proof of (D.7) . The generating function (D.2) is integrated. 



E(y^) = / G(u)du = p""/ 



du 






0 (1-qu)^ (r-1) (-q) (l-qu)‘^ ^ 



n=l r 

_ P-P 



(r-l)q 



n=0 



as required. 
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Proof of (D.8) . Replace u by uv and integrate twice. 



E(- 



1 2 1 1 1 ( r 

TZy) ~ f f G(uv) du dv = / dv {7 — q- 

1+’' i 6 II (l-quv)'-l 



U=1 

u=o! 



e 1_ 1 r 1 1 ] 

(r-l)q J uLn_. ^r-1 J 



du 



(l-qu) 



as required. 

Computational formulae require managing integrals of the type 



I = r — ^ 

^ 0 Lu(l-qu)^ 



du 



(D. 9 ) 



Clearly = 0 . It is useful to apply the partial fraction representation 
rep eatedly . 



u(l-qu)^ (1-qu)^ u(l-qu)^ ^ 



(D.IO) 



Since 



} _q_J_u_ . r_J_ . 

J .r r-1 1 r-1 M 

0 (l-qu) un -I 



we have, from (D.IO), 



r r-1 r-1 | r-1 J 



Let <r> be the greatest integer in r and apply the above formula to get 
the representation 

<r> 



I = ^ ^ - 1] + J 



<r> 



(D.ll) 



which is valid for r > <r>. 
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Let r = < r> . One can show directly from (D.9) that 



I = r q = 

1 (i-qu) 



- In p 



(D.12) 



and then 



= T -K - il 
j=i Lp^-j J 



1 1 

- In p = — - I — [p^ ^-P^] - In p (D.13) 



j=l ^ 



2 r- 

Accordingly it is recognized from (D.8) and (D.9) that E(l/l-hX) = [p / (r-l)q ] 
which, together with (D.7) and (D.13) enables 






1 r r-j_„ri _ 



_ (P-P^)^ ) 



-p^] - p^ In p - j (D.14) 



for r an integer ^ 2 (empty sum is zero), and for r = 1, 






(D.15) 



j-1 



This latter formula is developed in [12], see (A. 3) and (A. 5). 



Let r = < r> + . 5 . The exploitation of (D.ll) requires dealing with 



‘^r-<r> ^1/2 



= /'r __A__ 1 1 

0 L ^ J 



- du . 



Making the change qu = sin"" Q provides for manageable integrals of 
trigonometric functions. See [14, p. 316]. The result is 



Jl /2 2 In 



1 + V P 



(D.16) 
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and 






(D.17) 



which is valid for <r> ^ 1. For r = 1/2 one needs, from (D.9), 




= 2v^ - 2 + J 



1/2 



using formula [14, p.316]. From (D.8) 



(D.18) 



^^1+X^ ^ ■ f - 2 + = “1’^- P- In ^7^1 

for r = 1/2. 

The information matrix (4.19) contains a difficult element ^ 22 ’ 
(4 •20), It may be managed using an integral representation of the tri- 
gamma function [1, p. 259], 



1 

’l'’(r) = - / 

0 



In u 
1-u 



r-1 

u 



du 



(D.20) 



It follows that 



= ,j;’(r) - E{^’(r+X)} 

7 In u r r-1 , r-l+X. , 

= - J YZ~ tn - E(u )] du 

0 

• - fl r 1 



du 



(D.21) 
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This relationship can be expressed better if we make the change of variable 



w = pu/ (1-qu) 



and manipulate to obtain 



f ln(p+qw) r-1 

22 = -qJ 1-w " 



dw 



(D.22) 



There is advantage in using (D.22) in the development of the determinant of 
A. (4.19), 

1 



A 



r r ln(p+qw) r-1 , 1 

X I — 7 *^— — w dw r 

2 J 1 - w 2 

qp 0 p 



1 (. 1 J . 1) 

2 q 1-w 



In(p-Ki iHl) 

2 n 1-W 

qp 0 



(D.23) 



using partial integration and 



ln(p+qw) 

1-w 



oo n 



= ^(l-w) 



n-1 



In this form it is easily checked that 

1 

|A| = ^ J 

r ^ 0 



limit |A| - dw- 

qp 0 ^ 



-ln(p) ^0, 

2 

qp 



limit A = 0 



(D.24) 



For computational purposes one can exploit the expansion 
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. I si 

1 - w n 

n=z 



-2 



dw 



which, when used in (D.23), produces 



A = 



.n-2 



1 ^ nni(* ^/n 

— o I q J w (i-w) dw 

qp 2 " 0 



W 

_ 1 Y 

2 J ‘I 

P 1 



n n r I (n-1) .* 
'^■*‘1 (n+r) ! 



(D.25) 



I 



n f I 
] r . n I 



2 ^ n+1 (n+r) I 

p n=l 



Similar efforts applied to (D,22) can produce 



oo n 

^ y (r-1).’ n; 

22 2 (r+n-l)I 

n=l n 



(D.26) 
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APPENDIX E 



Moments of the Beta Distribution 



The Beta random variable X has density 



fix'd 6) = f t. ^ 

£U,ot S) X (1 x; 



for 0<x<l, 0<a, 0< 



(E.l) 



and 



r (a) = j x°* ^ dx 

0 



This is called the B(a,g) distribution and it is useful to note that 1-X 
has a B(6,a) distribution. 

Moments are obtained directly by manipulating gamma and beta functions. 
For r > 0 



Hex’") 



r (a-hB ) r (ct+r) _ (g+B-1).' (g+r-1) .' 
r(a) r(a 4 B+r) (a-1) ! (a+S+ r-1) I 



(E.2) 



E[X’"(1-X)®] 



r(a-hg) r (g+r) F (B+s) 
r (a) r ( 6 ) r (a+s+r+s) 



(E.3) 



and the mean and variance follow 



(E.4) 
(E.5) 

Moments of negative order exist if a is large enough. If 
a > r, B > s then 



U = 



a + B 



o 



aB 

(a+6)^ (giB+1) 
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. -r, ^ r(g-fB) r(g-r) 
’ r(g) r(g-fe-r) 



(E.5) 



E(X ’^)(1-X) 



r(g+6) r(g-r) r(B~s) 
r(g) r(6) r(g+e-r-s) 



(E.7) 



and the harmonic mean option will be available, 




1 + 



B 

g-1 



1 < g 



Var(i) 



B(g+B-1) 
(g-1)^ (g-2) 



2 < g 



and if g > 1, 6 > 1 





(g+B-1) 
(g-1) (B-1) 



(E.8) 



(E.9) 



(E.IO) 



Symmetric Beta 

Set g = 6 and obtain 



y = 1/2 



a = 



4(2g+l) 



E(|) 



2a-l 

a-1 



1 < a 



var(i) . 

(g-1)^ (g-2) 



2 < g 



= _ (2a-l) 

Cov(x» 2 

(g-1) 



1 < g 



(E.ll) 

(E.12) 

(E.13) 

(E.14) 

(E.15) 
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APPENDIX F 



APL PROGRAMS 



The function HARP performs the iteration (3.7) to estimate the 
Poisson parameter from the harmonic mean. The left argument X is the 
initial value (usually x) and the right argument is y of (3.6). The 
function HMV computes the variance of y using (B.7) and (B.8). The 
right argument is the set of parameter values, A. 



V L*-X HARP Y\LL 

[ 1 ] L^X 

[2] L1:LL*-L 

[3] 

[4] L 

[5] ( I L-LL )>0. 0001 

V 



V Y^HMV 

[ 1 ] 5 0 

C 2 ] V*-{ Li*l)o.*\N 

[3 ] D*-{ (M*-l tp P ) ,/V ) p ( *- i 77) X ( \ 77 ) X ; i 

[4] ,M) p*-L 

[5] ['*■ + / By Vi D 

[ 6 ] V*-ViL 

[7] V*-V-( (1- *-L) iL) *2 

V 



The function MONT produces E(X ) for the symmetric beta distri^ 
bution using (E.2) with a = 3. The left argument is the (integral) order 
of the moment and the right argument is the parameter. The function VAR 
computes the variance of the estimator, a of (3.16) using (3.17) and 
(A. 2). Again the argument is the parameter 
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7 Z^R MONT A;N;I 

[ 1 ] J-t-o 

[2] ZH N ,N^qA)p1 

[3] LliI^I+1 

[4] Z-*-Z^(^iN ,N)pA + I-l) iAo . -i-A+I-l 

[5] 

V 



V V^VAR A 

[1] 7-»-( 4 MONT 4)-(4x(3 MONT A ) x(l MONT /l))-(3x(2 MONT /I ) *2 ) - 4x ( 4+ 8x^ ] 

[2] F-»-4x Kx ( 1 + 2 x/1 ) *4 
7 

The polygairana functions are computed using PSI and JEX. When the 
(scalar) left argument, N, is zero the psi function is produced. Integral 
values of N index the order of the derivative of psi. The argument of 
the function appears on the right. The technique comes from the asymptotic 
expansions in Abramowitz. 



V P-^N PSI IiCiIV;JIViK\KK-,IY;ViZ;TiI 

[1] n N IS THE ORDER OF THE DERIVATIVE OF THE PSI FUNCTION 

[2] fl Y (>0) IS THE ARGUMENT , SCALAR OR VECTOR 

[3] C-^10 

[4] IV*-\pY*-,Y 

[5] P*-Z-^K^{pY)pQ 

[ 6 ] KK^\C-YiJIVHV-^Y<C)/ IV\ 

[7] ->-Llxi (pjy)=p(~y)/i7 

[ 8 ] T*-YiJIV'^ 

[9] I-^Q 

[10] L2:I*-I-H 

[11] j]pT[j] 

[12] Z[I]^( .'W)x+/( (yy-1 ) + i/^;i[I]) *-l+;v 

[13] -*L2^\I<pJIV 

[14] Z^K\Z[ip«7lW] 

[15] K*-V\KK 

[16] Z,1 :->51xx Af>0 

[17] P-H-(a^: + y)-( 2xj!^+y) *-i 

[18] -►2+J2 6 

[19] S1\P*-{{1N-1)^{ Y+K)*-N)-^( ;/V)xO. 5x( Y+K) *-Ni-l 

[20] PHi~l)*N + l)^P+Z+N JEX Y+K 

V 
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V JEX 1 \C;M-,F\E\A 

[1] USED IN THE POLYGAMMA FUNCTIONS 

[2 ] C-'C-f-C 5,(7J5),(5f7),(li:-25).(5x455nix691),(6Sli 7x 455)) 
[3] ,6)p(7 

[ 4 ] A/<*- 7 

[5] FH y*2 )o , x( 2+2x1 W-1 )x( 1 + 2x1 W-1 ) t(//+2xiW- 1 ) x(A’tl + 2x I W-1 ) 
[63 E*-C X F 

[7] /!-( 7+6 )x( y*-A'+2x,Vf )x( .« ~l + A'+2x/^) i( 2x/./) 

[8] c7'-^4x1 + £'[;6Jx 1+£’[ ;5]xl+£'[;4]xl+£'[;3]xl+£'[ ;2]xl+£’[; l] 

V 



The efficiency of the usual moment estimator (4.5) of gamma distri- 
bution parameters is produced by EFF using (4.8) and (4.14). The efficiency 
of (4.10) is computed by EFFH using (4.13) and (4.14), and the relative 
efficiency (4.15) is the function REFF. Only the parameter ct is needed 
and it is entered on the right for all three. It must be >2 for the 
latter two functions. 



V E^EFF Y 

[1] £'^(2x (y+i )x ( ( Jx ( 1 PSI Y ))-!))*-! 

V 



V E*-EFFH Y 

[ 1 ] £’->-2 + ( J -2 )+( I-l )*2 

[2] £--^-E’x(yxi PSI I)-l 

[3] £’-£■*-1 

V 



V Z-^REFF A 

[1] Z^(/l -1 )>^(-4-l ) M-4 + 1 )x4 -2 

V 
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We turn now to the programs that support the negative binomial 
distribution. The function MM computes the ordinary method of moments 
estimators p , r of (4.25). The left argument is x and the right 
argument is s^. 

V M^X MM S 

[ 1 ] P^XrS 

[2] X*2)^S-X 

V 

The determinant of the asymptotic covariance matrix of p, r is produced 

(see (4.29)) by DM, whose left and right arguments are r and p. 

2 

The function PSIB computes x (= XB) , s (= S) , y (= Y) , the 
estimators p, r (using MM), and the maximum likelihood estimate p, r 
by applying Newton’s method to (4.18) using p, f for initialization. 

The left argument, F, is the vector of observed frequencies corresponding 
to the right argument J, the vector of variate values. 



V Z^F PSIB J \PE\PHD;Z\ZZ 
Cl ] XB^(+/JxF)t+/F 

[2] S*-{ +/?)-! )x( +/ F^J*2)-i+/F)^XB*2 

[3] YH+/FiJ+l)i + /F 

[ 4 ] XB MM S 

[ 5 ] R,P 

[6] L1:PP-^P 

[7] Z-^+/(0 P5I P+<7)xp^+/p 

[8] ZZ-^+/(l PSI Pi-J)^Fi+/F 

[9] PH^Z -(.0 PSI P) -(aP) -aP + A'P 
CIO] PPP-^ZZ-d PSI R)-{iR)-iR+XB 

[11] R^R-PHiPHD 

[ 12 ] P^RiR-^XB 

[13] P,P 

[14] ->-Llxt ( I pp-p)>o. 0 0 01 

V 
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The function INFl computes the difficult element of the information matrix, 



^22 (4.20), using (D.26). The left and right arguments are r and p, 
resp. The vector r must be whole numbers except it will also handle the 
values .5, 1.5, ... ,4.5. 



7 Z-Hif P-,N-,A-,B\C\ZZ\J 

C 1 ] A i M 

[2] c<-( 1 -P-H,p)o . *// 

[3 ] (pA’) ,pP^,P)pO 

[4] ^^0 

C 3 3 Ij \ I tj ^rj + 1 

[6] A+PCJJ )>55 

[7] ->L2xi (pA) =pAA■^■(~^^)/ A 

[8] pAl-l'/ A)p 0 

[9] 4CAl;JJ^x/( ZZo.txP[J-]-l)4Alo.+,p[^]-l 

[10] ->L3xi (pAA)=0 

[11 ] L2\A [AA ;c7]^( AA) x( .»P[J]-l ) v.' AA+P[A ]-l 

[12] LZx -*Ll'<\J < qR 

[13] S-^C)( (pP) ,pA)pA*2 

[14] Z-^-C+.x^^S 

V 



The function DETI and DI both compute the determinant of the information 
matrix, but the former uses INFl in (4.19) and can handle the same set 
of r values which are whole numbers plus .5. The latter, DI , uses (D.25) 
and all values of r must be whole numbers. 



V V^R DETI P 

[1] V^(R INFl P)x<5( , 4( P*2 )xi -p 

[2] (pP) ,pP)pP>-2 

V 
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V D^R DI PiA\B-,C\N\Pl\J \Z 
[ 1 ] N^xM 

[2] C^il-P-^ ,P)o . 

[3] P1^6?( (pi?^,i?) .pP)pP*-2 

[4] (pAT) ,pP^.P)pO 
C5] Z-^-(pAT)pO 

[6] J-^Q 

[7] + 1 

C8] ALiJl-^x/iZo . + iP[J] .+IPCJ-] 

[9] -*LlxxJ<pR 

[10] (p R) tP N)p N+1 

[11] D*~ Pl^D -*-C + . ^AiB 

V 



The determinant (4.29) of the asymptotic covariance matrix (4.28) 
is computed by the function DM 



V D^R DM P 

[1] P-^2x(l+p-t-,P)o.x(p*2)n-P-^,P 

7 



and the efficiency of the moment estimator is NBEF 



V P-^-P NBEF P 

[1] E*-i{R DM P)>^<SfR DETI P 

V 



The function HAS. implements the iteration scheme described in 
(4.32) and produces the harmonic mean based alternative estimators p , r 
from the system (4.30). The left argument is x and the right argument 
is y . 

V P^X EAR Y\PP\C-,GiGP\Z 

[1] P-»-$-f-0.5 

[2] C*-Y-1-Y^X 

[3] R*-X 

[4] L1-.PP*-P 

[5] G*-{Z*-P*R)-Y-CxP 

[6] GP*-C-\-Z^X^{Q*9P)iQ*2 

[7] P-^-P-GiGP 

[8] R^X^PiQ^l-P 

[9] R.P,G 

[10] -^-Llxi ( I (?)>1P~6 

V 



60 



The Var(l/1+X) is computed by VHMl using (D.14) and (D.15) for r values 
that are whole numbers, and using (D.17) and (D.19) for r values that 
are half way between whole numbers. 



7 Z*-R VHMl •,PP\RR-,S\H\HH\HHH-,M\U1\U2-,J -JhJJH -,Q-,SS 

[1] PI VAR OF FOR NEC BII1{R,P)\R MUST BE WHOLE OR WHOLE PLUS .5 

[ 2 ] RH~U2*-R=0.S)/ R^{~U1*-R = 1) / R 

[3] Z -^5 -<-(( •<-,/?), ?P‘*-pP-^ , P )p 0 

[4] -*-L3xi(pP)=0 

[5] 1^0 

[6] 

[7] 

[a ] ->( 3 +X26 ) X I s ^LP 

[9] B-^B-1 

[103 5[J;]-^-oP 

[11] -*■( 2 tl2 6 ) X I S =LS 

[1 23 5[I; 3x-2x« 2 vl + P*0. 5 

[133 BBH PP ,lB)pRLll-l + \lB 

[143 /4>lx-( po . *-/?[73-i +iLP) -1 

[153 SLIil^SU;l+ + /AAiBB 

[16 3 ->L 1 X I J< L PP-»-p P 

[173 P3:P->-(~i/2)\P 

[183 ->-L4xi(pP) = 0 

[193 P[i/2/ i PPx-pp+ + /i/2 3^C . 5 

[ 2 03 5x-(~P2)\[13 S 

[213 -^( 2+l26)xi 0 = + /i/2 

[223 P[£/2/iPP;3x-t(2xa2Tl + P*0.5)t2x(P*0.5)-l 

[23 3 Z->-Sx( P^5?P° . *R) UlHH^i P-1 )«> . XI -P 

[243 (HH-( RR ,PP)pP) -H) iHHH 

[ 2 5 3 Z^Z-M*2 

[ 26 3 L4;Px-( 2x950iPxl 000) to^-^l-P 
[2 73 7^[ ( 50)/ i PP3-^50 

[2 8 3 SS^PPpO 
[2 9 3 U*-0 

[ 3 0 3 L2:J-^J+1 

[313 PSx- + /( Qo . PA’) t( PP,//P)p ( iP/V^r /yfJ3 ) *2 

[323 -*L2^\J<PP 

[33 3 -»•( 1 +X2 6 ) x( +/ PI ) 

[34 3 P-(~P1)\P 

[3 5 3 P[Pl/iPP->-PP + + /Pl 3-^1 
[36 3 Z--(~P1)\[13 Z 

[ 3 7 3 Z[Pl/iPP; 3^?x(SS-Px( {^P ) *2) i Q) Q 
7 
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The function DMSI computes the determinant of M from (4.36). Two 
auxiliary functions are needed: G22 provides and from the 

second row of A in (4.33) (compare (2.17)) and DMSI is needed to handle 
values of r = 1, special handling being required becuase of (4.34). 



V X^R DMSI P\RR;PP;HHiC;LiG;G71;Z-.U 
Cl] AV( (p/?) ,pP)pO 

[ 2 ] R^{~U->-R = l)/ R 

[3] Z-<-{L-^R<‘ . i(P*2)xQ-<-l~P)xG-*-^R G22 P 

[4] ZH Z+(<^G21 (RR-<-pR) ,PP^pP)pP)*2 

C5] Z^2t(LxC-^( ( (P-1 )o .x§)*2 )xp VHMl P)-fi?G21*2 

[6] J[(~(/)/ xPP++/i/; 

[7] -*-(2 + l2 6)xi0=+/(/ 

[8] XLU/\RR + +/U P 

7 



Cl] 

C2] 



V Z^DMSl P\W 

Z->-{ ( 0 . 5 x( ®p ) *2 ) + J/-*-l -P-<»P) *2 

2-^Zt( ( (pP)pi P)x( (i-p)*3) 



7 Z^R G22 P\PP\RR\T\TT-.H-,HE-,V',W 
Cl] H-*-{ ( PP-*-p P -*- , P ) , RR-^p R-*- ,R )p Po . *p 

C2] PP-fi)(PP ,PP)pP 
C3] Z^HH-H 
C4] V'^P = 1 
C5] TTH~V)/xRR 

C6] r-^PxCj(pp, pp)pap 

C7] ZL;TT2->-TliTT2 + ZLiTT2i(pZL iTT p(~ V ) / R-l 

Ca] G21H {PP ,RR)pR)^HiHH 
C9] G21^(721-( 1-P)t< 9(PP ,Pp)pi-p 

7 



62 



The efficiency of the harmonic mean based estimator p , r is computed 
by the function EFF 

V EFF P 

[1] E^iR DMSI DETI P 

V 

Both EFF and NBEF used DETI and, for that reason, are restricted to only 

* * 

a few fractional values of r. The relative efficiency of p , r with 
respect to p, r is not so restricted. It is computed by RELEF and accepts 
any r > 0. 

V E^R RELEF P 

Cl] E^(iR DM P)x{ R DMEI P ) ) 

V 



Methods for computing the efficiency of beta distribution estimators 
are supported by the following programs: The determinant of the information 

matrix (4.39) is computed by the functions DINF and DDINF. The former takes 
a single (vector) argument and produces a symmetric, square matrix of 
values |a| for all pairs of components of the arguments. If the arguments 
a and B must be entered separately, then the two argument DDINF can be 
used . 



V L^DINF A\R\B\C 

Cl] . +/] 

[2] B*-i N ,N^pA)0l PS I L 

[3] . xc )- (Co . PSI A)^B 

V 



V L*-A DDINF B\M-,N ^CA-.CB^D 

[ 1 ] 

[2] D*-{{M^pA) ,:i*-p3)pl PSI L 

[3] L^iCAo .^CB)-i {CA-1 PSI /l)o.+C5-l PSI 3)^D 



V 
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The function DETMM accepts two by two matrices A (on the left) 



of (2.7). The function ADET uses it to produce an array of such 
determinants. The arguments H and C are f our-dimensional and may be 
thought of as an M by N array of 2 by 2 matrices. The left set, H, 
are the coefficient arrays A of (2.2) and the right set, C, are the 
covariances (2.6) 



V MM^H ADET 

[1 ] MMH fpff )p0 

[2] I^J^Q 

[ 3 ] 

[ 4 ] J^O 

[ 5 ] 

[ 6 ] ;I;J1 DETMM 

[ 7 ] ^L2^\J<N 

[ 8 ] <M 



The computation of the efficiency of the ordinary moment estimator 



(4.42) requires the coefficient array (4.43) and the convariance array 
(4.45). The former is computed by the function COEFM and the latter by 
COVM. Each takes a single vector argument and computes the required 
values for all pairs of components in the argument. The function COVM 
requires the beta distribution moments (E.2) and these are computed by 



and C (on the right) and computed the determinant 




V M^A DETMM C 

Cl] )+.xc?c[3^ 

[2] M^(/-/[1;1]x,1/[2;2])-W[1;2]*2 



V 



V 



MONT. 
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V H*-COEFM 

Cl] /y^( 2, 2,/V,.V<^p>4 )pl 

[2] y^Cl ; 1 ; ; ]^(A’,/V )p->3 

[3] //[I; 2; ; ]^(y/,A)p4 

[4] o . )x (;4 o . +/i )iiA° . i-A+l) 

[5] y^C2;l; ;]^B + (^o. -A )><U!,N)pA 

[6] //[2; 2; ; ]^B+( ( -4 ) » . +/1 ) ( A' , A' ) p^ 

[7] H->-HH2,2,N ,N)pA'= .+A 

V 



V z-/? A/OAr A;i;;I 
Cl] 1^0 

C 2 ] Z^( A’ ,/y^pAl )pl 

[3] 

C 4 ] Z-*-Zx( 6 )(.V ,/V )pyl+I-l )iAo. +.4 tI -1 

C5] -^Llxi7<A? 

V 



All these are utilized by EFBM which computes the efficiency (2.22). 

The output is a symmetric matrix. The argument must have positive components. 



V E^EFBM A\L\C xHxM 
Cl] L^DINF A 

[2] C^COVM A . 

[3] H^COEFM A 

[4] ABET C 

[5] E<-^MxL 

V 



The efficiency of (4.48) is handled in similar fashion. (Also 
with single arguments.) An array of 2 by 2 matrices (4.49) is produced 
by the function COEF and the matching matrices (4.50) by COV. These are 
used by EFBH to compute a symmetric matrix of efficiencies. The argument 
must have all components > 2. 
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V H->-CO EF A;N 

[1] H*-i2 ,N^pA)p-l 

[2 ] ^i'[2; 2; ; ]*-A <> ,iA-l 

[3] ;]-fi)//[2;2; ;] 

V 



\ 


7 C*-C0V A 


\H 








Cl] 


CH 2.2, 


N A ) pi 






C2] 


CCl; 1 ; ; 


]-^( 4 ® . 


+ A - 


1 


) xO/1 o , iA-2 


C3] 


6’Cl;2; ; 


]^CC2; 


1 ; ; 


] 


*--A o . +A -1 


C4] 


CC2;2; ; 


]-*-(/} O . 


+A- 


1 


)x(.Ao . iA -2 ) 



V 



V E*-EFBH A;C-,H-,L 

[1] L^DINF A 

[2] H^COEF A 

[3] C^COV /I 

[4] M-^H ADET C 

[5] 

V 



The estimator (4.52) is managed in like fashion, only this time 



the arguments a (left) and 6 (right) must be entered separately with 
a > 2, S > 0. The coefficients (4.53) are computed by the function COEFH 
and the covariances (4.54) by COVH. These are drawn on by EFBMH to produce 
the efficienices 



V H*-A COEFH 

Cl] 2,(.V^p/l ) ,A'-pS)p-l 

[2 ] .■/[ 1 ;1 ; ; ]>^^(/lo . +S)t(A/,A' )p -B 

[3] Hil\2-, ,M)pA) iAo .-^B 

[4] ^/[2;1; ;]^r(/l-l )o. vfl 

V 
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Cl] 

[ 2 ] 

[3] 

[4] 



Cl] 

C2] 

C3] 

C4]- 

C5] 



C*-A COVH BiN;M 
C^{2,2,iM^pA ) ,N^pB)pO 
CLl;l ;;]^(/5o.x5)i^o.+B+i 
CCl; 2; ; ]^C-C2;1; ; ]^(W,iV)p-B 
CC 2; 2; ; ^■<r-iA ■> . +B - 1 ) i{A -2 ) <> , iB 



7 E^A EFBMH B\C\H',M',L 
L*-A DDINF B 
H^A COEFH B 
C*-A COVH B 
M*-H ADET C 
L 
V 



« 
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